Towards High-dimensional Data Analysis in Air Quality Research
نویسندگان
چکیده
Analysis of chemical constituents from mass spectrometry of aerosols involves non-negative matrix factorization, an approximation of high-dimensional data in lower-dimensional space. The associated optimization problem is non-convex, resulting in crude approximation errors that are not accessible to scientists. To address this shortcoming, we introduce a newmethodology for user-guided error-aware data factorization that entails an assessment of the amount of information contributed by each dimension of the approximation, an effective combination of visualization techniques to highlight, filter, and analyze error features, as well as a novel means to interactively refine factorizations. A case study and the domain-expert feedback provided by the collaborating atmospheric scientists illustrate that our method effectively communicates errors of such numerical optimization results and facilitates the computation of high-quality data factorizations in a simple and intuitive manner.
منابع مشابه
Methods for regression analysis in high-dimensional data
By evolving science, knowledge and technology, new and precise methods for measuring, collecting and recording information have been innovated, which have resulted in the appearance and development of high-dimensional data. The high-dimensional data set, i.e., a data set in which the number of explanatory variables is much larger than the number of observations, cannot be easily analyzed by ...
متن کاملdimensional film dosimetry with GAFCHROMIC films for quality assurance and dosimetric verification of 3D conformal radiotherapy in the presence of heterogeneities
Introduction: The presence of heterogeneities such as air-filled cavities in the head and neck treatment fields region, may result in potential dosimetric disagreement because the losses of charged particle equilibrium. Most of treatments planning systems are not able to predict dose distribution of inhomogeneities region accurately. Therefore, dose calculation algorithms need to...
متن کاملCombined Cluster Analysis and Principal Component Analysis to Reduce Data Complexity for Exhaust Air Purification
Anthropogenic and demographic processes cause worldwide air problems, giving rise to focus on exhaust air purification to counteract these effects. Due to the large number of substances found in exhaust air and the various operational parameters needed, a huge amount of often high dimensional data has to be analyzed. The ultimate goal is to finally reduce data complexity in terms of information...
متن کاملبررسی همدیدی روزهای بسیار آلوده درشهرمشهد مورد مطالعه: 13و14نوامبر2007
For a synoptic analysis of high polluted days in 13 and 14 November 2007, a combinatorial synoptic analysis was used. From methodology prospect, the present study has utilized the "circular environment" synoptic approach and with respect to the restrictions on very high-polluted days in Mashhad city, the subjective synoptic analysis used for data processing and analyzing the prevailin...
متن کاملAnalysis of Air Pollution
This research paper is an attempt towards analyzing real time air pollution data collected by PAQS sensor devices from some key locations in Bangalore. Air pollution in most of the metropolitan cities in India is turning out to be a major threat to our environment and hazardous to our health. Many infections and diseases related to lungs and throat are caused by the polluted air we breathe. The...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Comput. Graph. Forum
دوره 32 شماره
صفحات -
تاریخ انتشار 2013